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This paper reviews the measurements of galaxy correlations at high redshifts, 
and discusses how these may be understood in models of hierarchical gravita- 
tional collapse. The clustering of galaxies at redshift one is much weaker than at 
present, and this is consistent with the rate of growth of structure expected in 
an open universe. If = 1, this observation would imply that bias increases at 
high redshift, in conflict with observed M/L values for known high-z clusters. At 
redshift 3, the population of Lyman-limit galaxies displays clustering which is of 
similar amplitude to that seen today. This is most naturally understood if the 
Lyman-limit population is a set of rare recently-formed objects. Knowing both 
the clustering and the abundance of these objects, it is possible to deduce em- 
pirically the fluctuation spectrum required on scales which cannot be measured 
today owing to gravitational nonlinearities. Of existing physical models for the 
fluctuation spectrum, the results are most closely matched by a low-density spa- 
tially flat universe. This conclusion is reinforced by an empirical analysis of CMB 
anisotropics, in which the present-day fluctuation spectrum is forced to have the 
observed form. Open models are strongly disfavoured, leaving ACDM as the most 
successful simple model for structure formation. 



1 . Background 

(a) Evolution of mass fluctuations 

Attempts to understand the evolution of structure in the galaxy distribution 
start with the assumption that this evolution is directly related to gravitationally- 
driven evolution of the dark matter. This is a well understood problem, with the 
following features: 

(1) Linear evolution. The fractional density contrast 5 evolves according to 
linear perturbation theory as 

5^a{t)g{^)- 9{^)^[^i ""^^^ (LI) 

where a{t) = (1 + z)~^ is the scale factor and the growth suppression factor g{^) 
is much less important for k = Q models; the universe only discovers rather late 
that there is a nonzero lambda (Lahav et al. 1991; Carroll, Press &: Turner 1992). 

(2) Stable clustering. In the opposite extreme of highly nonlinear clustering, 
there is Peebles' concept of stable clustering, in which virialized objects maintain 
fixed proper size and merely change their separation with time. This leads to the 



Phil. Trans. R. Soc. Land. A (1997) (Submitted) 
Printed in Great Britain 1 



1997 Royal Society Typescript 
1^ Paper 



2 



J. A. Peacock 



common parameterization for the correlation function in comoving coordinates: 

ar,z) = [r/ro]-Hl + zr(-^-^+'\ (1.2) 

where e = is stable clustering; e = 7 — 3 is constant comoving clustering; 
e = 7 — lisr2 = l linear-theory evolution. 

Although this equation is frequently encountered, it is probably not appli- 
cable to the real world, because most data inhabit the intermediate regime of 
1 ^ C ~ 100. Peacock (1997) showed that the expected evolution in this quasilin- 
ear regime is significantly more rapid: up to e ~ 3. 

(&) General aspects of bias 

Of course, there are good reasons to expect that the galaxy distribution will 
not follow that of the dark matter. The main empirical argument in this direction 
comes from the masses of rich clusters of galaxies. It has long been known that 
attempts to 'weigh' the universe by multiplying the overall luminosity density by 
cluster M/L ratios give apparent density parameters in the range O, ~ 0.2 to 0.3 
(e.g. Carlberg et al. 1996). 

An alternative argument is to use the abundance of rich clusters of galaxies in 
order to infer the rms fractional density contrast in spheres of radius 8 Mpc. 
This calculation has been carried out several different ways, with general agree- 
ment on a figure close to 

(White, Efstathiou & Frcnk 1993; Eke, Cole & Frcnk 1996; Viana & Liddlc 1996). 
The observed apparent value of ag in, for example, APM galaxies (Maddox, Efs- 
tathiou Sz Sutherland 1996) is about 0.95 (ignoring nonlinear corrections, which 
are small in practice, although this is not obvious in advance). This says that 
= 1 needs substantial positive bias, but that $7 < 0.4 needs antihias. Although 
this cluster normalization argument depends on the assumption that the density 
field obeys Gaussian statistics, the result is in reasonable agreement with what 
is inferred from cluster M/L ratios. 

What effect does bias have on common statistical measures of clustering 
such as correlation functions? We could be perverse and assume that the mass 
and light fields are completely unrelated. If however we are prepared to make the 
more sensible assumption that the light density is a nonlinear but local function 
of the mass density, then there is a very nice result due to Coles (1993): the bias 
is a monotonic function of scale. Explicitly, if scale-dependent bias is defined as 

KO^[WyW/Cma.s(r)]^/^ (1.4) 

then 6(r) varies monotonically with scale under rather general assumptions about 
the density field. Furthermore, at large r, the bias will tend to a constant value 
which is the linear response of the galaxy-formation process. 

There is certainly empirical evidence that bias in the real universe does work 
this way. Consider Fig. 1, taken from Peacock (1997). This compares dimen- 
sionless power spectra (A^(/c) = da'^/dlnk) for IRAS and APM galaxies. The 
comparison is made in real space, so as to avoid distortions due to peculiar veloc- 
ities. For IRAS galaxies, the real-space power was obtained from the the projected 
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Figure 1. The real-space power spectra of optically-selected APM galaxies (solid circles) and 
IRAS galaxies (open circles), taken from Peacock (1997). IRAS galajcics show weaker clustering, 
consistent with their suppression in high-density regions relative to optical galaxies. The relative 
bias is a monotonic but slowly-varying function of scale. 

correlation function: 

/oo 
e[(r2 + x2)i/2] dx. (1.5) 
-oo 

Saunders, Rowan-Robinson &; Lawrence (1992) describe how this statistic can 
be converted to other measures of real-space correlation. For the APM galaxies, 
Baugh & Efstathiou (1993; 1994) deprojected Limber's equation for the angular 
correlation function w{9) (discussed below). These different methods yield rather 
similar power spectra, with a relative bias that is perhaps only about 1.2 on large 
scale, increasing to about 1.5 on small scales. The power-law portion for k > 
0.2 /iMpc^"*^ is the clear signature of nonlinear gravitational evolution, and the 
slow scale-dependence of bias gives encouragement that the galaxy correlations 
give a good measure of the shape of the underlying mass fluctuation spectrum. 



2. Observations of high-redshift clustering 

(a) Clustering at redshift 1 

At z = 0, there is a degeneracy between $7 and the true normalization of the 
spectrum. Since the evolution of clustering with redshift depends on fi, studies at 
higher redshifts should be capable of breaking this degeneracy. This can be done 
without using a complete faint redshift survey, by using the angular clustering 
of a flux-limited survey. If the form of the redshift distribution is known, the 
projection effects can be disentangled in order to estimate the 3D clustering at 
the average redshift of the sample. For small angles, and where the redshift shell 
being studied is thicker than the scale of any clustering, the spatial and angular 
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correlation functions are related by Limber's equation (e.g. Peebles 1980): 

roo poo 

w{e)= yU\y)Ciy)dy ^{[x^ + y^e^]'/^z) dx, (2.1) 

JO J-oo 

where y is dimensionless comoving distance (transverse part of the FRW metric 
is [R{t)y dO]'^) , and C{y) = [1 — A;y^]~^/^; the selection function for radius y is 
normalized so that / y'^(/){y)C{y) dy = 1. Less well known, but simpler, is the 
Fourier analogue of this relation: 

A^(i^) = A\[K/y],z) y'4>\y)C{y) dy, (2.2) 

where Ag is the angular power spectrum and K is angular wavenumber (Kaiser 
1992). In either case, the angular clustering tends to be sensitive to the spatial 
clustering at the redshift, z, at which y'^(p{y) peaks. 

This relation has been used by many workers in order to interpret angular 
clustering of faint galaxies (e.g. Efstathiou et al. 1991; Neuschaefer, Windhorst & 
Dressier 1991; Couch et al. 1993; Roche et al. 1993). The general conclusion was 
always that clustering seemed to be weaker in the past, but the rate of evolution 
was not very well tied down, owing to uncertainties in the redshift distribution 
for faint galaxies, plus the fact that projection effects leave only a very small 
clustering signal. The uncertainties in interpreting w{6) for faint galaxies were 
first convincingly overcome by the CFRS team, who assembled a large enough 
redshift survey to construct the correlation function directly out to z ~ 1 (Le 
Fevre et al. 1996). Their results were well described by ro ^ 2h~^ Mpc at z = I, 
i.e. evolution at about the e = 1 rate. Other groups have found similar results 
(e.g. Carlberg et al. 1997) - although Carlberg's presentation at this meeting 
argued for slightly slower evolution. Although this rate of evolution is in accord 
with the expected linear-theory evolution in an = 1 model, the discussion of 
section 1(a) shows that such a result is in fact more consistent with lower density 
models. Since the data are in the quasi-linear regime, the expected evolution in 
a critical-density universe would be much more rapid. 

The observed clustering at z ~ 1 is thus larger than would be expected if 
J7 = 1. There is no difficulty with this, since we shall see below that bias is 
expected to evolve in the sense of being higher at early times. However, consider 
the implications for cluster M/L ratios: we have already seen that the observed 
degree of bias at z = must reduce these by about a factor of 5 in the cores of 
rich clusters. If the bias at z = 1 is significantly greater than today, this trend 
must continue, so that the apparent Ti' from high-z clusters would be expected 
to be very small. Conversely, if Q is low today, the z = 1 clustering would be 
nearly unbiased and we would expect to see the true at that time - which 
should have evolved to be close to unity. So, this leaves the nice paradox that the 
way to prove f2 = 1 today is to observe a very small Ti' at z = 1 - and vice versa. 

It has recently become possible to carry out this test, through the detection 
of massive clusters at redshifts near unity. Many of these have been found through 
X-ray detections, which almost guarantees a high virial temperature and hence 
a high mass (e.g. the EMSS sample: Henry et al. 1992) . The existence of massive 
clusters at high redshift is a potential problem for high-density models, owing to 
the more rapid evolution of the mass fluctuations in this case, and it has been 
claimed that = 1 is ruled out (e.g. Luppino & Gioia 1992; Henry 1997). How- 
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ever, the M/L argument is more powerful since only a single cluster is required, 
and a complete survey is not necessary. Two particularly good candidates at 
z ~ 0.8 are described by Clowe et al. (1998); these are clusters where significant 
weak gravitational-lensing distortions are seen, allowing a robust determination 
of the total cluster mass. The mean F-band M/L in these clusters is 230 Solar 
units, which is close to typical values in z = clusters. However, the comoving 
F-band luminosity density of the universe is higher at early times than at present 
by about a factor (1 + ^)^-^ (Lilly et al. 1996), so this is equivalent to M/L ~ 1000, 
implying an apparent 'Jl' of close to unity. In summary, the known degree of bias 
today coupled with the moderate evolution in correlation function back to z = 1 
implies that, for = 1, the galaxy distribution at this time would have to consist 
very nearly of a 'painted-on' pattern that is not accompanied by significant mass 
fiuctuations. Such a picture cannot be reconciled with the healthy M/L ratios 
that are observed in real clusters at these redshifts, and this seems to be a strong 
argument that we do not live in an Einstein-de Sitter universe. 

(b) Clustering of Lyman-limit galaxies at redshift 3 

The most exciting recent development in observational studies of galaxy clus- 
tering is the detection by Stcidel et al. (1997) of strong clustering in the popula- 
tion of Lyman-limit galaxies at z ~ 3. The evidence takes the form of a redshift 
histogram binned at Az = 0.04 resolution over a field 8.7' x 17.6' in extent. For 
= 1 and z = 3, this probes the density field using a cell with dimensions 

cell = 15.4 X 7.6 x 15.0 [h'^ Mpc]^. (2.3) 

Conveniently, this has a volume equivalent to a sphere of radius 7.5 Mpc, so 
it is easy to measure the bias directly by reference to the known value of ug. Since 
the degree of bias is large, redshift-space distortions from coherent infall are small; 
the cell is also large enough that the distortions of small-scale random velocities 
at the few hundred kms~^ level are also small. Using the model of equation 
(11) of Peacock (1997) for the anisotropic redshift-space power spectrum and 
integrating over the exact anisotropic window function, the above simple volume 
argument is found to be accurate to a few per cent for reasonable power spectra: 

o-ccu - = 3) 0-7.5(2: = 3), (2.4) 

defining the bias factor at this scale. The results of section 1 (see also Mo & 
White 1996) suggest that the scale-dependence of bias should be weak. 

In order to estimate o"ccii, simulations of synthetic redshift histograms were 
made, using the method of Poisson-sampled lognormal realizations described by 
Broadhurst, Taylor &: Peacock (1995): using a statistic to quantify the nonuni- 
formity of the redshift histogram, it appears that cjccll — 0.9 is required in order 
for the field of Steidel et al. (1997) to be typical. It is then straightforward to ob- 
tain the bias parameter since, for a present-day correlation function ^(r) oc r~^-^, 

a7.5(z = 3) = C78 X [8/7.5]^-®/2 ^ 1/4 ^ o.l46, (2.5) 

implying 

6(2 = 3 I O = 1) ~ 0.9/0.146 ~ 6.2. (2.6) 

Steidel et al. (1997) use a rather different analysis which concentrates on the 
highest peak alone, and obtain a minimum bias of 6, with a preferred value of 8. 
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They use the Eke et al. (1996) value of cjg = 0.52, which is on the low side of the 
published range of estimates. Using ag = 0.55 would lower their preferred b to 
7.6. Note that, with both these methods, it is much easier to rule out a low value 
of b than a high one; given a single field, it is possible that a relatively 'quiet' 
region of space has been sampled, and that much larger spikes remain to be found 
elsewhere. A more detailed analysis of several further fields by Adelbcrger et al. 
(1998) in fact yields a bias figure very close to that given above, so the first field 
was apparently not unrepresentative. 

Having arrived at a figure for bias if 17 = 1, it is easy to translate to other 
models, since cjccll is observed, independent of cosmology. For low 0, models, 
the cell volume will increase by a factor [Sl{r) dr]/[S'^{ri) dri]; comparing with 
present-day fluctuations on this larger scale will tend to increase the bias. How- 
ever, for low Q, two other effects increase the predicted density fluctuation at 
z = 3: the cluster constraint increases the present-day fluctuation by a factor 
and the growth between redshift 3 and the present will be less than a 
factor of 4. Applying these corrections gives 

b{z = 3\Q = 0.3) ^ r 0.42 (open) 

b{z = 3\n = l) 1 0.60 (flat) ' ^ ' ^ 

which suggests an approximate scaling as 6 oc ^l^-'^'^ (open) or fi''-^^ (flat). The 
significance of this observation is thus to provide the first convincing proof for 

the reality of galaxy bias: for J7 0.3, bias is not required in the present universe, 
but we now see that 6 > 1 is needed at z = 3 for all reasonable values of O,. 

(c) Clustering of high-redshift AGN 

The strength of clustering for Lyman-limit galaxies fits in reasonably well 
with what is known about clustering of AGN. A comoving correlation length of 
ro — 6.5 Mpc has been measured for radio-quiet QSOs at (z) ~ 1.5 (Shanks 
&: Boyle 1994; Groom & Shanks 1996). This value is much larger than the clus- 
tering of optically-selected galaxies at z ~ 1, but this may not be unreasonable, 
since imaging of QSO hosts reveals them to be several-L* objects, comparable in 
stellar mass to radio galaxies (e.g. Dunlop et al. 1993; Taylor et al. 1996). It is 
plausible that the clustering of these massive galaxies at z ~ 1 will be enhanced 
through exactly the same mechanisms that enhances the clustering of Lyman- 
limit galaxies at z ~ 3. Of course, this does not rule out more complex pictures 
based on ideas such as close interactions in rich environments being necessary to 
trigger AGN. However, as emphasised below, the mass and rareness of these ob- 
jects sets a minimum level of bias. It is to be expected that this bias will increase 
at higher redshifts, and so one would not expect quasar clustering to decline at 
higher redshifts. Indeed, it has been claimed that ^ either stays constant at the 
highest redshifts (Andreani & Cristiani 1992; Groom & Shanks 1996), or even 
increases (Stephens et al. 1997). 

Radio-source clustering at high redshifts has been detected only in projection. 
The FIRST survey has measTircd ilj{0) to high precision for a limit of 1 mJy at 
1.4 GHz (Gress et al. 1996). Their result detects clustering at separations between 
0.02 and 2 degrees, and is fitted by a power law: 

w{6) = 0.003 [6'/degrees]-^-^ (2.8) 

There had been earlier claims of detections of angular clustering, notably the 
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87GB survey (Loan, Lahav & Wall 1997), but these were of only bare significance 
(although, in retrospect, the level of clustering in 87GB is consistent with the 
FIRST measurement). Discussion of the 87GB and FIRST results in terms of 
Limber's equation has tended to focus on values of e in the region of 0. Cress 
et al. (1996) concluded that the w(9) results were consistent with the PN91 
value of To — 10h~^M.pc (although they were not very specific about e). Loan 
et al. (1997) measured w{l°) ~ 0.005 for a 5-GHz limit of 50 mJy, and inferred 
ro 12 Mpc for e = 0, falling to ro 9 Mpc for e = — 1. 

The reason for this strong degeneracy between ro and e is that ro parame- 
terizes the z = clustering, whereas the observations refer to a typical redshift 
of around unity. This means that ro{z = 1) can be inferred quite robustly to 
be about 7.5 h^^ Mpc, without much dependence on the rate of evolution. Since 
the strength of clustering for optical galaxies at z = 1 is known to correspond 
to the much smaller number of ro — 2h~^ Mpc (e.g. Le Fevre et al. 1996), we 
see that radio galaxies at this redshift have a relative bias parameter of close to 
3. The explanation for this high degree of bias is probably similar to that which 
applies in the case of QSOs: in both cases we are dealing with AGN hosted by 
rare massive galaxies. 



3. Formation and bias of high-redshift galaxies 

The challenge now is to ask how these results can be understood in cur- 
rent models for cosmological structure formation. It is widely believed that the 
sequence of cosmological structure formation was hierarchical, originating in a 
density power spectrum with increasing fluctuations on small scales. The large- 
wavelength portion of this spectrum is accessible to observation today through 
studies of galaxy clustering in the linear and quasilinear regimes. However, non- 
linear evolution has effectively erased any information on the initial spectrum 
for wavelengths below about 1 Mpc. The most sensitive way of measuring the 
spectrum on smaller scales is via the abundances of high-redshift objects; the 
amplitude of fluctuations on scales of individual galaxies governs the redshift 
at which these objects first undergo gravitational collapse. The small-scale am- 
plitude also influences clustering, since rare early- forming objects arc strongly 
correlated, as first realized by Kaiser (1984). It is therefore possible to use obser- 
vations of the abundances and clustering of high-redshift galaxies to estimate the 
power spectrum on small scales, and the following section summarizes the results 
of this exercise, as given by Peacock et al. (1998). 



(a) Press-Schechter apparatus 

The standard framework for interpreting the abundances of high-redshift 
objects in terms of structure-formation models, was outlined by Efstathiou & 
Rees (1988). The formalism of Press & Schechter (1974) gives away of calculating 
the fraction Fc of the mass in the universe which has collapsed into objects more 
massive than some limit M: 



V2a{M) 



Fc{> M,z) = l- erf 
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Here, a{M) is the rms fractional density contrast obtained by filtering the linear- 
theory density field on the required scale. In practice, this filtering is usually 
performed with a spherical 'top hat' filter of radius R, with a corresponding mass 
of ^TrpijR^ /3, where ph is the background density. The number 6^ is the linear- 
theory critical overdensity, which for a 'top-hat' overdensity undergoing spherical 
collapse is 1.686 - virtually independent of O. This form describes numerical 
simulations very well (see e.g. Ma &; Bertschinger 1994). The main assumption is 
that the density field obeys Gaussian statistics, which is true in most inflationary 
models. Given some estimate of Fc, the number cr{R) can then be inferred. Note 
that for rare objects this is a pleasingly robust process: a large error in Fc will 
give only a small error in (t(R), because the abundance is exponentially sensitive 
to a. 

Total masses are of course ill-defined, and a better quantity to use is the 
velocity dispersion. Virial equilibrium for a halo of mass M and proper radius r 
demands a circular orbital velocity of 

V! = 3.2 

r 

For a spherically collapsed object this velocity can be converted directly into a 
Lagrangian comoving radius which contains the mass of the object within the 
virialization radius (e.g. White, Efstathiou & Prenk 1993): 



2i/2[-tA/i00kms- 
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R/h-^ M.^c= -j^. (3.3) 

J^^/^l+Ze)V2/V6 

Here, Zc is the redshift of virialization; $7^ is the present value of the matter 

density parameter; fc is the density contrast at virialization of the newly-collapsed 
object relative to the background, which is adequately approximated by 

/, = 178/J7^6(z,), (3.4) 

with only a slight sensitivity to whether A is non-zero (Eke, Cole &; Frenk 1996). 
For isothermal-sphere haloes, the velocity dispersion is 

= Vc/V2. (3.5) 

Given a formation redshift of interest, and a velocity dispersion, there is then a 
direct route to the Lagrangian radius from which the proto-object collapsed. 

(6) Abundances and masses of high-redshift objects 

Three classes of high-redshift object can be used to set constraints on the 
small-scale power spectrum at high redshift: 

(1) Damped Lyman-a systems Damped Lyman-a absorbers are sys- 
tems with HI column densities greater than ~ 2 X 10^"^ m-2 (Lanzetta et al. 1991). 
If the fraction of baryons in the virialized dark matter halos equals the global 
value S7b, then data on these systems can be used to infer the total fraction of mat- 
ter that has collapsed into bound structures at high rcdshifts (Ma & Bertschinger 
1994, Mo & Miralda-Escude 1994; Kauffmann & Chariot 1994; Klypin et al. 1995). 
The highest measurement at (z) ~ 3.2 implies J7hi — 0-0025h~^ (Lanzetta et al. 
1991; Storrie-Lombardi, McMahon & Irwin 1996). If n^h^ = 0.02 is adopted, as 
a compromise between the lower Walker et al. (1991) nucleosynthesis estimate 
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and the more recent estimate of 0.025 from Tytler et al. (1996), then 

= ^ ~ 0.12/i (3.6) 

" 'b 

for these systems. In this case alone, an explicit value of h is required in order to 
obtain the collapsed fraction; h = 0.65 is assumed. 

The photoionizing background prevents virialized gaseous systems with cir- 
cular velocities of less than about 50 kms^^ from cooling efficiently, so that they 
cannot contract to the high density contrasts characteristic of galaxies (e.g. Ef- 
stathiou 1992). Mo & Miralda-Escude (1994) used the circular velocity range 
50 - 100 kms~^ (<T„ = 35 - 70 kms~^) to model the damped Lyman alpha 
systems. Reinforcing the photoionization argument, detailed hydrodynamic sim- 
ulations imply that the absorbers are not expected to be associated with very 
massive dark-matter haloes (Haehnelt, Steinmetz &: Ranch 1998). This assump- 
tion is consistent with the rather low luminosity galaxies detected in association 
with the absorbers in a number of cases (Le Brun et al. 1996). 

(2) Lyman-limit galaxies Stcidcl et al. (1996) identified star-forming 
galaxies between z = 3 and 3.5 by looking for objects with a spectral break red- 
wards of the U band. The treatment of these Lyman-limit galaxies in this paper 
is similar to that of Mo & Fukugita (1996), who compared the abundances of 
these objects to predictions from various models. Steidel et al. give the comoving 
density of their galaxies as 

N{n = 1) ~ 10-2-54 ( /j-i Mpc)-^ (3.7) 

This is a high number density, comparable to that of L* galaxies in the present 
Universe. The mass of L* galaxies corresponds to collapse of a Lagrangian region 
of volume ^ 1 Mpc^, so the collapsed fraction would be a few tenths of a per cent 
if the Lyman-limit galaxies had similar masses. 

Direct dynamical determinations of these masses arc still lacking in most 
cases. Steidel et al. attempt to infer a velocity width by looking at the equivalent 
width of the C and Si absorption lines. These are saturated lines, and so the 
equivalent width is sensitive to the velocity dispersion; values in the range 

0-^ ~ 180-320kms~^ (3.8) 

are implied. These numbers may measure velocities which are not due to bound 

material, in which case they would give an upper limit to Vc/V^ for the dark 
halo. A more recent measurement of the velocity width of the Ha emission line 
in one of these objects gives a dispersion of closer to 100 kms^^ (Pettini, private 
communication), consistent with the median velocity width for Lya of 140 kms^-^ 
measured in similar galaxies in the HDF (Lowenthal et al. 1997). Of course, these 
figures could underestimate the total velocity dispersion, since they are dominated 
by emission from the central regions only. For the present, the range of values 
cr^ = 100 to 320 kms"^ will be adopted, and the sensitivity to the assumed 
velocity will be indicated. In practice, this uncertainty in the velocity does not 
produce an important uncertainty in the conclusions. 

(3) Red radio galaxies An especially interesting set of objects are the 
reddest optical identifications of 1-mJy radio galaxies, for which deep absorption- 
line spectroscopy has proved that the red colours result from a well-evolved stellar 
population, with a minimum stellar age of 3.5 Gyr for 53W091 at 2; = 1.55 (Dun- 

Phil. Trans. R. Soc. Land. A (1997) 



10 



J. A. Peacock 



lop et al. 1996; Spinrad et al. 1997), and 4.0 Gyr for 53W069 at z = 1.43 (Dunlop 
1998; Dey et al. 1998). Such ages push the formation era for these galaxies back 
to extremely high redshifts, and it is of interest to ask what level of small-scale 
power is needed in order to allow this early formation. 

Two extremely red galaxies were found at z = 1.43 and 1.55, over an area 
1.68 X 10~^ sr, so a minimal comoving density is from one galaxy in this redshift 
range: 

N{n = 1) > lO"^-^'^ ( Mpc)-3. (3.9) 

This figure is comparable to the density of the richest Abell clusters, and is thus 
in reasonable agreement with the discovery that rich high-redshift clusters appear 
to contain radio-quiet examples of similarly red galaxies (Dickinson 1995). 

Since the velocity dispersions of these galaxies are not observed, they must 
be inferred indirectly. This is possible because of the known present-day Faber- 
Jackson relation for ellipticals. For 53W091, the large-aperture absolute magni- 
tude is 

Mviz = 1.55 I O = 1) ~ -21.62 - 5 log^o h (3.10) 

(measured direct in the rest frame). According to Solar-metallicity spectral syn- 
thesis models, this would be expected to fade by about 0.9 mag. between z = 1.55 
and the present, for an = 1 model of present age 14 Gyr (note that Bender 
et al. 1996 have observed a shift in the zero-point of the M — relation out to 
z = 0.37 of a consistent size). If we compare these numbers with the - My 
relation for Coma {m — M = 34.3 for h = 1) taken from Dressier (1984), this 
predicts velocity dispersions in the range 

(7^ = 222 to 292 km s~^ (3.11) 

This is a very reasonable range for a giant elliptical, and it adopted in the following 
analysis. 

Having established an abundance and an equivalent circular velocity for these 
galaxies, the treatment of them will differ in one critical way from the Lyman-a 
and Lyman-limit galaxies. For these, the normal Prcss-Schechter approach as- 
sumes the systems under study to be newly born. For the Lyman-a and Lyman- 
limit galaxies, this may not be a bad approximation, since they are evolving 
rapidly and/or display high levels of star- formation activity. For the radio galax- 
ies, conversely, their inactivity suggests that they may have existed as discrete 
systems at redshifts much higher than z ~ 1.5. The strategy will therefore be to 
apply the Press-Schechter machinery at some unknown formation redshift, and 
see what range of redshift gives a consistent degree of inhomogeneity. 

4. The small-scale fluctuation spectrum 

(a) The empirical spectrum 

Fig. 2 shows the a{R) data which result from the Press-Schechter analysis, 
for three cosmologies. The cr{R) numbers measured at various high redshifts have 
been translated to z = using the appropriate linear growth law for density 
perturbations. 

The open symbols give the results for the Lyman-limit (largest R) and 
Lyman-a (smallest R) systems. The approximately horizontal error bars show 
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Figure 2. The present-day linear fractional rms fluctuation in density averaged in spheres of 

radius R. The data points are Lyman-a galaxies (open cross) and Lyman-limit galaxies (open 
circles) The diagonal band with solid points shows red radio galaxies with assumed collapse 
redshifts 2,4,... 12. The vertical error bars show the effect of a change in abundance by a factor 2. 
The horizontal errors correspond to different choices for the circular velocities of the dark-matter 
haloes that host the galaxies. The shaded region at large R gives the results inferred from 
gala^xy clustering. The lines show CDM and MDM predictions, with a large-scale normalization 
of (78 = 0.55 for f2 = 1 or (T8 = 1 for the low-density models. 

the effect of the quoted range of velocity dispersions for a fixed abundance; the 
vertical errors show the effect of changing the abundance by a factor 2 at fixed 
velocity dispersion. The locus implied by the red radio galaxies sits in between. 
The different points show the effects of varying collapse redshift: Zc = 2,4, ... ,12 
[lowest redshift gives lowest cr{R)]. Clearly, collapse redshifts of 6 - 8 are favoured 
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for consistency with the other data on high-redshift galaxies, independent of the- 
oretical preconceptions and independent of the age of these galaxies. This level 
of power {(t[R] 2± 2 for i? c± 1 Mpc) is also in very close agreement with the 
level of power required to produce the observed structure in the Lyman alpha 
forest (Croft et al. 1998), so there is a good case to be made that the fluctu- 
ation spectrum has now been measured in a consistent fashion down to below 
Rc^lh-^ Mpc. 

The shaded region at larger R shows the results deduced from clustering data 
(Peacock 1997). It is clear an = 1 universe requires the power spectrum at small 
scales to be higher than would be expected on the basis of an extrapolation from 
the large-scale spectrum. Depending on assumptions about the scale-dependence 
of bias, such a 'feature' in the linear spectrum may also be required in order to 
satisfy the small-scale present-day nonlinear galaxy clustering (Peacock 1997). 
Conversely, for low-density models, the empirical small-scale spectrum appears 
to match reasonably smoothly onto the large-scale data. 

Fig. 2 also compares the empirical data with various physical power spectra. 
A CDM model (using the transfer function of Bardeen et al. 1986) with shape 
parameter F = = 0.25 is shown as a reference for all models. This appears to 
have approximately the correct shape, although it overpredicts the level of small- 
scale power somewhat in the low-density cases. A better empirical shape is given 
by MDM with D,h ~ 0.4 and Q,y ~ 0.3. However, this model only makes physical 
sense in a universe with high fi, and so it is only shown as the lowest curve in 
Fig. 2c, reproduced from the fitting formula of Pogosyan &; Starobinsky (1995; 
see also Ma 1996). This curve fails to supply the required small-scale power, by 
about a factor 3 in a; lowering Jlj^ to 0.2 still leaves a very large discrepancy. 
This conclusion is in agreement with e.g. Mo & Miralda-Escude (1994), Ma & 
Bertschingcr (1994), Ma et al. (1997) and Gardner et al. (1997). 

All the models in Fig. 2 assume n = 1; in fact, consistency with the COBE 
results for this choice of ag and O/i requires a significant tilt for flat low-density 
CDM models, n ~ 0.9 (whereas open CDM models require n substantially above 
unity). Over the range of scales probed by LSS, changes in n arc largely degenerate 
with changes in 0,h, but the small-scale power is more sensitive to tilt than to 
fi/i. Tilting the Q = 1 models is not attractive, since it increases the tendency for 
model predictions to lie below the data. However, a tilted low-S7 flat CDM model 
would agree moderately well with the data on all scales, with the exception of the 
'bump' around R ^ 30 Mpc. Testing the reality of this feature will therefore 
be an important task for future generations of redshift survey. 

(&) Collapse redshifts and ages for red radio galaxies 

Are the collapse redshifts inferred above consistent with the age data on the 
red radio galaxies? First bear in mind that in a hierarchy some of the stars in a 
galaxy will inevitably form in sub-units before the epoch of collapse. At the time 
of final collapse, the typical stellar age will be some fraction a of the age of the 
universe at that time: 

age = t{zohs) - t{zc) + at{zc). (4.1) 

We can rule out a = 1 (i.e. all stars forming in small subunits just after the big 
bang). For present-day ellipticals, the tight colour-magnitude relation only allows 
an approximate doubling of the mass through mergers since the termination of 
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Figure 3. The age of a galaxy at z = 1.5, as a function of its collapse redshift (assuming an 
instantaneous burst of star formation). The various lines show = 1 [solid]; open Q = 0.3 
[dotted]; flat fl = 0.3 [dashed]. In all cases, the present age of the universe is forced to be 14 
Gyr. 



star formation (Bower at al. 1992). This corresponds to cc ~ 0.3 (Peacock 1991). 
A non-zero a just corresponds to scaling the collapse redshift as 

apparent (1 + Zc) oc (1 - a)~'^^^, (4.2) 

since t oc (1 + at high redshifts for all cosmologies. For example, a galaxy 

which collapsed at z = 6 would have an apparent age corresponding to a collapse 
redshift of 7.9 for a = 0.3. 

Converting the ages for the galaxies to an apparent collapse redshift depends 
on the cosmological model, but particularly on Hq. Some of this uncertainty may 
be circumvented by fixing the age of the universe. After all, it is of no interest to 
ask about formation redshifts in a model with e.g. fl = 1, h = 0.7 when the whole 
universe then has an age of only 9.5 Gyr. If J7 = 1 is to be tenable then either 
h < 0.5 against all the evidence or there must be an error in the stellar evolution 
timescale. If the stellar timescales are wrong by a fixed factor, then these two 
possibilities are degenerate. It therefore makes sense to measure galaxy ages only 
in units of the age of the universe - or, equivalently, to choose freely an apparent 
Hubble constant which gives the universe an age comparable to that inferred for 
globular clusters. In this spirit, Fig. 3 gives apparent ages as a function of efi'ective 
collapse redshift for models in which the age of the universe is forced to be 14 
Gyr (e.g. Jimenez et al. 1996). 

This plot shows that the ages of the red radio galaxies are not permitted very 
much freedom. Formation redshifts in the range 6 to 8 predict an age of close to 
3.0 Gyr for f2 = 1, or 3.7 Gyr for low-density models, irrespective of whether A is 
nonzero. The age-2;c relation is rather flat, and this gives a robust estimate of age 
once we have some idea of Zc through the abundance arguments. It is therefore 
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rather satisfying that the ages inferred from matching the rest-frame UV spectra 
of these galaxies are close to the above figures. 

(c) The global picture of galaxy formation 

It is interesting to note that it has been possible to construct a consistent 
picture which incorporates both the large numbers of star-forming galaxies at 
z < 3 and the existence of old systems which must have formed at very much 
larger redshifts. A recent conclusion from the numbers of Lyman-limit galaxies 
and the star-formation rates seen at z ~ 1 has been that the global history of star 
formation peaked at z ~ 2 (Madau et al. 1996). This leaves open two possibilities 
for the very old systems: either they arc the rare precursors of this process, and 
form unusually early, or they arc a relic of a second peak in activity at higher 
redshift, such as is commonly invoked for the origin of all spheroidal components. 
While such a bimodal history of star formation cannot be rejected, the rareness of 
the red radio galaxies indicates that there is no difficulty with the former picture. 
This can be demonstrated quantitatively by integrating the total amount of star 
formation at high redshift. According to Madau et al.. The star-formation rate 
at 2; = 4 is 

/j* ~ W-^h Mq Gyr-i Mpc'^, (4.3) 

declining roughly as (1 + z)~^. This is probably a underestimate by a factor of at 
least 3, as indicated by suggestions of dust in the Lyman-limit galaxies (Pettini 
et al. 1997), and by the prediction of Pei & Fall (1995), based on high-z element 
abundances. If we scale by a factor 3, and integrate to find the total density in 
stars produced at 2; > 6, this yields 

p^zf > 6) ~ IO^-^Mq Mpc-3. (4.4) 

Since the red mJy galaxies have a density of 10~^'^^/i^Mpc~^ and stellar masses 
of order 10^-*^ Mq, there is clearly no conflict with the idea that these galaxies are 
the first stellar systems of L* size which form en route to the general era of star 
and galaxy formation. 

(d) Predictions for biased clustering at high redshifts 

An interesting aspect of these results is that the level of power on 1-Mpc 
scales is only moderate: a{l Mpc) ~ 2. At z ~ 3, the corresponding figure 
would have been much lower, making systems like the Lyman-limit galaxies rather 
rare. For Gaussian fluctuations, as assumed in the Press-Schechter analysis, such 
systems will be expected to display spatial correlations which are strongly biased 
with respect to the underlying mass. The linear bias parameter depends on the 
rareness of the fluctuation and the rms of the underlying held as 

6=1 + ^ = 1 + ^ (4.5) 

vcr be 

(Kaiser 1984; Cole & Kaiser 1989; Mo & White 1996), where v = 6c/a, and cr^ is 
the fractional mass variance at the redshift of interest. 

In this analysis, 5c = 1.686 is assumed. Variations in this number of order 10 
per cent have been suggested by authors who have studied the flt of the Press- 
Schechter model to numerical data. These changes would merely scale 6 — 1 by 
a small amount; the key parameter is u, which is set entirely by the collapsed 
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Figure 4. The bias parameter a,t z = 3.2 predicted for the Lyman-limit galaxies, as a function 
of their assumed circular velocity. Dotted line shows ft = 0.3 open; dashed line is ft = 0.3 flat; 
solid line is ft = 1. A substantial bieis in the region of 6 ~ 6 is predicted rather robustly. 

fraction. For the Lyman-limit galaxies, typical values of this parameter are ~ 3, 
and it is clear that very substantial values of bias are expected, as illustrated in 
Fig. 4. 

This diagram shows how the predicted bias parameter varies with the as- 
sumed circular velocity, for a number density of galaxies fixed at the level ob- 
served by Steidel et al. (1996). The sensitivity to cosmological parameter is only 
moderate; ai Vc = 200 kms~^, the predicted bias is 5 ~ 4.6, 5.5, 5.8 for the 
open, flat and critical models respectively. These numbers scale approximately as 
V-^-'^, and b is within 20 per cent of 6 for most plausible parameter combinations. 
Strictly, the bias values determined here are upper limits, since the numbers of 
collapsed haloes of this circular velocity could in principle greatly exceed the 
numbers of observed Lyman-limit galaxies. However, the undercounting would 
have to be substantial: increasing the collapsed fraction by a factor 10 reduces 
the implied bias by a factor of about 1.5. A substantial bias seems difficult to 
avoid, as has been pointed out in the context of CDM models by Baugh et al. 
(1998). 

Comparing the bias values in Fig. 4 with those observed directly (section 
2b), we see that the observed value of b is quite close to the prediction in the 
case of = 1 - suggesting that the simplest interpretation of these systems 
as collapsed rare peaks may well be roughly correct. Indeed, for high circular 
velocities there is a danger of exceeding the predictions, and it would create 
something of a difficulty for high-density models if a velocity as high as Vc ^ 
300 kms~^ were to be established as typical of the Lyman-limit galaxies. For 
low Q, the 'observed' bias falls faster than the predictions, so there is less danger 
of conflict. For a circular velocity of 200 kms~^, we would need to say that the 
collapsed fraction was underestimated by roughly a factor 10 (i.e. increase the 
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multipole number J2 

Figure 5. The angular power spectrum for the temperature fluctuations in the CMB. These pre- 
dictions fix the matter power spectrum today to have the shape inferred from galaxy clustering 
and the normalization inferred from the abundance of rich clusters. The 'observed' values are 
adopted for the other cosmological parameters: h = 0.65, Q^/^ = 0.1. At long wavelengths, 
where no galaxy clustering data exist, the spectrum is assumed to be scale invariant; failure 
to match COBE thus indicates that tilt is required. However, the power at £ ^ 2000 is nearly 
spectrum independent, since this is where the normalization scale sits. The rejection of open 
models is thus very nearly model independent. 

values of a in Fig. 2 by a factor of about 1.5) in order to lower the predicted 
bias sufficiently, either by postulating that the conversion from velocity to R is 
systematically in error, or by suggesting that there may be many haloes which are 
not detected by the Lyman-limit search technique. It is hard to argue that either 
of these possibilities are completely ruled out. Nevertheless, we have reached the 
paradoxical conclusion that the observed large-amplitude clustering at z = 3 is 
more naturally understood in an O = 1 model, whereas one might have expected 
the opposite conclusion. 



5. Empirical predictions for CMB anisotropies 

The recurring theme of this paper has been that it is now possible to mea- 
sure the fluctuation spectrum empirically to an interesting precision. On large 
scales, this is possible using galaxy clustering to give the shape of the spectrum, 
with the cluster abundance giving the normalization. On small scales, we have 
seen how information on high-redshift galaxies gives answers that are reason- 
ably consistent with extrapolation of the large-scale results. This situation is to 
be contrasted with the normal approach to measurements of CMB anisotropies, 
where the results are fitted by variants on CDM models, adjusting the parameters 
{^Im, ^v,^B, h, n). If CMB data alone are considered, many combinations of these 
parameters can fit existing results; however, in many cases the predicted z = 
matter fluctuation spectrum will be in gross disagreement with observation. 
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This problem is often tackled by requiring acceptable models to fit some 
statistic such as cjg. However, an alternative route is to recall that the CMB 
calculations are entirely linear, and that they are based on the evolution of a 
given Fourier mode from last scattering at z ~ 1100 to the present. The equations 
involved are time-symmetric, so there is no reason why the integration cannot be 
carried out backwards. If we believe that the amplitude of gravitational potential 
fluctuations at z = has been measured as a function of scale, then it makes sense 
to place these fluctuations at last scattering and deduce an empirical prediction 
of the CMB fluctuations. In practice, this can be achieved by a process which 
resembles 'designer inflation': assume a suitable fluctuation spectrum at z > 1100 
such that any features in the transfer function are cancelled, leaving the desired 
power spectrum at z = 0. Described in this way, the process sounds unnatural; 
however, the standard lore suggests that perturbations are generated at z ~ 10^®, 
so there is much more room at z > 1100 for unknown extra physics than there is 
at z < 1100. 

This approach still leaves free the global cosmological parameters. The CMB 
results clearly depend on Q^n and r2„, since the inferred fluctuation spectrum 
depends on these (although only weakly on Qv)- The other parameters can be 
fixed at their empirical values, taken here to be /i = 0.65 and Q-g/^ = 0.1. For an 
extreme empirical approach, no power would be assumed beyond the largest scale 
at which clustering is observed in the galaxy distribution {k ~ 0.02 /iMpc^^). A 
reasonable alternative, adopted here, is to allow the spectrum to vary with some 
power-law index n on larger scales. Finally, the collisionless dark matter is taken 
to be cold and the fluctuations are assumed to be isentropic; variations of either 
of these assumptions would lead to larger fluctuations. 

The results of this calculation are shown in Fig. 5, which was generated using 
a modified form of the CMBFAST code of Seljak & Zaldarriaga (1996). This looks 
quite different from the plots usually seen in this field, for the following reasons: 

(1) The normalization comes direct from the power spectrum; no attempt 
has been made to fit the CMB data. 

(2) The models arc thus not COBE normalized, although they could be made 
to fit COBE by adjusting the large-wavelength index n. Open models would 
require n > 1, flat models n < 1. 

(3) Adjusting n in this way only affects Ce for £ < 300, since larger multipoles 
project to parts of k space probed by galaxy clustering. 

The last point is especially important, since it means that it is the high-£ clus- 
tering where robust predictions can be made. The k ~ 0.2/iMpc~^ waves that 
determine as project to £ ~ 1200 for O = 1, or to £ ~ 1200/il and 1200/fi°-'' for 
open and flat models respectively. This difference in angular-diameter distance is 
one half of the reason why the predictions for open models in Fig. 5 are so much 
higher than the predictions for flat models with the same parameters. The other 
difference is the difference in linear growth suppression factors, which amounts 
roughly to a factor fi°-^ (equation 1.1). Since the present-day power spectrum 
and its normalization is highly insensitive to A, there is thus a very simple recipe 
for predicting the CMB anisotropics for a given open model: calculate the corre- 
sponding flat case, boost the results by a factor and translate the spectrum 
to higher ^ by a factor 0"°-^. This recipe fails for the lowest multipoles, where 
spatial curvature is important. However, the critical £ ~ 1000 results follow this 
scaling almost exactly. 
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The conclusion is therefore that, for any flat model which roughly fits the 
CMB data, its open counterpart will be grossly in error, and this is just what is 
seen in practice. Flat models with $7 > 0.3 are acceptable, but open models are 
qualitatively wrong unless 0, > 0.5. It is interesting to note that this conclusion 
comes not so much from the modern data at £ ~ 200, but from the long-standing 
OVRO upper limit at £ ~ 2000. The inconsistency of this result with most open 
models was noted by Bond &: Efstathiou (1984), and all that has changed since 
then is that we now prefer to use the cluster normalization, rather than the 
unbiased normalization chosen by Bond &: Efstathiou. This raises the amplitude 
for low-fi models, making it that much harder for them to get anywhere near to 
the data. 



6. Conclusions 

The data on the abundances and clustering of both radio-loud and radio- 
quiet galaxies at high redshift appear to be in good quantitative agreement with 
the expectation of models in which structure formation proceeds through hier- 
archical merging of haloes of dark matter. Furthermore, the existing data yield 
an empirical measurement of the fluctuation spectrum on sub-Mpc scales. In 
general, this small-scale spectrum is close to what would be expected from an 
extrapolation of LSS measurements, but there are deviations in detail: = 1 
places the small-scale data somewhat above the LSS extrapolation, whereas open 
low-Q models suffer from the opposite problem; low-il A-dominated models fare 
somewhat better, especially with a slight tilt. These last models also account well 
for the i ~ 1000 CMB anisotropics if the dark matter is assumed to be pure 
CDM, normalized to COBE (whereas open models fail badly). Until recently, it 
appeared that geometrical tests such as the supernova Hubble diagram (Perlmut- 
ter et al. 1997) or gravitational lensing (Carroll, Press & Turner 1992; Kochanek 
1996) were strongly inconsistent with A-dominatcd models, so the overall situa- 
tion was badly confused. However, with recent developments in these areas now 
appearing to favour a nonzero A (Garnavich et al. 1998; Chiba & Yoshii 1997), 
it is possible that a consistent picture may be emerging. 

The main remaining difficulty for ACDM lies in the shape of the large-scale 
power spectrum measured from the APM survey around k = 0.1 hMpc^^. This is 
a region of the spectrum which is well within the capability of 2dF and Sloan, so 
we can confidently expect this problem to be either confirmed or removed within 
the next few years. The subject of structure formation thus stands at a critical 
point: either we are close to having a 'standard model' for galaxy formation and 
clustering, or we may have to accept that radical new ideas are needed. At the 
current rate of observational progress, the verdict should not be very far away. 
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